Modulation Cepstrum Discriminating between Speech and Environmental Noise
نویسندگان
چکیده
We introduce "the modulation cepstrum," a novel representation of an acoustic signal used to distinguish speech signal and environmental noises. The modulation cepstrum was computed by taking the inverse Fourier transform of the logarithmic modulation spectrum, a spectral representation of the temporal dynamics of a sub-band. We calculated the center of gravity of accumulated the modulation cepstrum for eight seconds of an acoustic signal as an index. The experimental result showed that this index enabled us to discriminate between speech and noise signals.
منابع مشابه
Leveraging Jointly Spatial, Temporal and Modulation Enhancement in Creating Noise-robust Features for Speech Recognition
This paper presents to adopt various fusion types of spatial, temporal and modulation domain speech feature enhancement techniques in order to achieve superior speech recognition performance under noise-corrupted environments. With the mel-frequency cepstral coefficients (MFCC) as the standard speech feature representation, the spatial-domain techniques involve the short-time intra-frame featur...
متن کاملAM-FM Based Robust Speaker Identification in Babble Noise
Speech babble is one of the most challenging noise interference due to its speaker/speech like characteristics for speech and speaker recognition systems. Performance of such systems strongly degrades in the presence of background noise, like the babble noise. Existing techniques solve this problem by additional processing of speech signal to remove noise. In contrast to existing works, the aim...
متن کاملRobust Speech Recognition with MSC/DRA Feature Extraction on Modulation Spectrum Domain
This report introduces noise robust speech recognition and proposes advanced speech analysis techniques named MSC (Modulation Spectrum Control)/DRA (Dynamic Range Adjustment). The dynamic range of cepstrum obtained from noisy speech is usually smaller than that from the same speech without noise since some speech features are hidden in noise. This difference may cause recognition errors. Theref...
متن کاملFusion of Acoustic, Perceptual and Production Features for Robust Speech Recognition in Highly Non-stationary Noise
Improving the robustness of speech recognition systems to cope with adverse background noise is a challenging research topic. Extraction of noise robust acoustic features is one of the prominent methods used for incorporating robustness in speech recognition systems. Prior studies have proposed several perceptually motivated noise robust acoustic features, and the normalized modulation cepstral...
متن کاملPhysiologically Motivated Feature Extraction for Robust Automatic Speech Recognition
In this paper, a new method is presented to extract robust speech features in the presence of the external noise. The proposed method based on two-dimensional Gabor filters takes in account the spectro-temporal modulation frequencies and also limits the redundancy on the feature level. The performance of the proposed feature extraction method was evaluated on isolated speech words which are ext...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002